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Abstract 


Finite element Galerkin methods for periodic first order hyperbolic equations exhibit 
superconvergence on uniform grids at the nodes, i. e., there is an error estimate 0(h 2r ) 
instead of the expected approximation order 0(h r ). In this paper it will be shown that no 
matter how the approximating subspace S h is chosen, the superconvergence property is 
lost if there are characteristics leaving the domain. We shall also discuss the implications 
of this result when constructing compact implicit difference schemes. 



1 Introduction 


Error estimates for finite element approximations based on Galerkin methods for 

du 
dt 


= Pu 


are determined by the approximation property of the finite element space. If the approx- 
imation order is r, the convergence rate is 0(h r ). However, for problems with periodic 
solutions and uniform grids there is superconvergence at the grid points. Thomee and 
Wendroff [10] showed that if the differential operator P is of order m, then the error at 
the grid points is 0(h u ), v — 2r — m for m even v = 2r — m+ 1 for m odd. The question is 
whether this superconvergence property remains in the presence of boundary conditions 
for finite computational domains. In this paper it will be shown that the superconver- 
gence property is lost, no matter how the approximating subspace S h is chosen, if there 
are characteristics leaving the domain. We will illustrate the loss of superconvergence by 
considering the simple model problem 


u i — , 0 ^ x ^ oo , 0 ^ i \ 

U(x, 0) = f(x) [ ’ 

and the case r = 2 (for simplicity we introduced only one boundary by considering the 
interval [0,oo)). The scalar product and the corresponding norm are defined by 

r oo 

(u,v) = / u(x)v(x)dx , ||u|| 2 = (u,u), (2) 

Jo 

It is assumed that ||/|| < oo. We shall henceforth only consider real functions. 

The problem (1) has a unique solution (without specifying any boundary condition at 
x = 0) satisfying an energy estimate 

i|K,()|| 2 = -u(0,t)\ (3) 

or, equivalently, 

IW-.OII’ + fu(0,rfir =||/(-)|| 2 . (4) 

Jo 

Let S be the Sobolev space of functions u(x) with \\du/dx\\ < oo. The weak formulation 
of (1) is 

Find u(x,t) £ S such that for every fixed t 

(u t ,v) = (u x ,v) , v65, 

(5) 

u(x,0) = f(x) . 


A grid is defined by Xj = jh, j = 0, 1, . . .. Let Sh be the space of piecewise polynomials 
of degree r — 1 . The Galerkin method can be formulated as follows 
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( 6 ) 


Find u h (x, t) € Sh such that for every fixed t 

(u h t ,v h ) = {u h x ,v h ) , v h <E S h , 
u h {x,0 ) = /'‘(a;). 

An attractive feature of the Galerkin method is that stability follows directly from the 
well-posedness of the continuous problem. In fact, substituting v h = u h in (6) yields 

i\\ur = (7) 


Consider the approximating space Sh of piecewise linear functions. The superconver- 
gence result states that the error is 0(h 4 ) rather than 0(h 2 ). This is easily seen if we 
consider the hat functions as a basis of 5^, where 


<Pi{z) 


a 

h 

0 , otherwise, 


(p 0 (x) 


— — (x — zi) , 0 < x < xi , 

h 

0 , otherwise. 


Since <fi(xj) = Sij and 

oo 

u (x,t) = u j{t) i Pj{ :1 ') ) 

i=o 


the Galerkin method (6) can be viewed as a difference method for uy 


( 8 ) 


( 9 ) 


Id, \ 

— — [Uj+\ + 4uj + u j—i) 


^j+i U j— i 

2 h 


1 , 2 ,... 


( 10 ) 


Id. x Uo /i i \ 

3* (2 “° + “ ,) = ~ ' T~ ' ( 1 

The superconvergence result holds for problems with periodic solutions, where (11) is 
replaced by the periodicity condition Uj = Uj+w. In this case the Galerkin method (10) is 
the well known fourth order compact difference approximation (proposed by Kreiss, see 
[9]). However, equation (11), applied in the non-periodic case, is only first order accurate, 
and thus, as will be shown in the next section, the global error estimate is second order 
only. 

In section 3 we shall prove that there is no way to modify the approximating space 
near the boundary such that the fourth order accuracy is retained. In fact, the boundary 


f 
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condition (11) is optimal in the sense that any choice of boundary functions leads to only 
second order accuracy at most. 

The equations (10), (11) can be viewed as a difference approximation to (1). Indeed, 
this interpretation was used by Thomee and Wendroff when deriving the superconvergence 
result. Such methods are often called implicit compact difference approximations or 
Pade approximations. Compared to standard explicit approximations of the same order, 
they have a considerably smaller error coefficient, making them competitive with explicit 
schemes, despite the fact that they are implicit. 

By leaving the Galerkin formulation, there is greater flexibility when modifying the 
approximation near the boundary. Carpenter et. al. [1] constructed a stable and third 
order accurate boundary modification for (11) that results in fourth order global accuracy. 
However, the condition (7) is no longer fulfilled, and it is shown that there is a growth in 
time (independent of h since the scheme is stable). 

The semi-discrete compact scheme can be written in the form 

pd i = < 12 > 

where u is the vector of unknowns uj, P and Q are band matrices, P symmetric positive 
definite (SPD). Asymptotic stability (also called time stability or strict stability) in the 
norm ( u , Pu)h follows from 

(.Mi Qu)h = 2 U 0 ■ 

where the discrete scalar product (•, •)/, and the corresponding norm || • ||^, are defined by 

CO 

{«,!>}* = XX v jh, ||m||& = {M,u)h- (13) 

3=0 

This is the same condition as (7). It is convenient also for difference methods when applied 
to systems, since it allows for stable implementations of physical boundary conditions in 
a direct way, see for example [2] and [8]. 

It follows directly from the negative result for Galerkin methods that there is no way 
of modifying P, Q near the boundary with our conditions above satisfied, as long as P has 
to be SPD. The question then arises whether there is another norm (u, Pu)h such that 
our conditions can be fulfilled with new P and Q in (12). We shall prove in section 4 that 
this is impossible unless P is allowed to be different from P at interior points. 

Recently, Carpenter et. al. [2] have shown that the conditions can be fulfilled if P is 
modified also at inner points. In the case of the fourth order scheme with tridiagonal P 
at inner points, the new matrix norm is pentadiagonal at inner points. For explicit high 
order approximations it is possible to construct simpler norms and still obtain asymptotic 
time stability, see the work by Kreiss and Scherer [4, 5] and by Olsson [6, 7, 8]. Wahlbin 
[11] has discussed a different concept of superconvergence for elliptic problems, where the 
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error in the derivative is of the same order as the function itself. It should be emphasized 
that the concept of superconvergence that we are discussing refers to the error estimates 
of the function itself. 


2 Necessary Conditions for the Accuracy Near the 
Boundary 


In [3] it is proved that for certain classes of difference schemes it is possible to lower the 
accuracy one order at the boundary without loosing the convergence rate defined by the 
interior scheme. We shall prove in this section that the accuracy near the boundary cannot 
be decreased any further, if we want to retain the interior convergence rate. Consider a 
semi-discrete approximation of any linear first order system 


_ d.Uj 1 _ - n i 

P3 ~dt = h Q3U3 ' 


(14) 


where the difference operators Pj, Q-j satisfy Pj = P and Q-j — Q for j > s. The error 
e-j(t) = uj(t) — u(xj, t) then obeys 


„ de; 1 ^ „ . 

Pj ^ T Qj i J 0, 1, ... . 

where the truncation error gj(t) is a smooth function of t that satisfies 


(15) 


\9M = 


f 0(h*), i = o , i ,...,5 — i 


0(h r ) , j > s 


It is assumed that q < r. Suppose that the initial data are exact, i. e., £j(0) — 0, j > 0. 
Define u T = (u 0 . . . ), v T = (u( 0, t )...), g T = (g 0 . . . ), and e = u - v. Here v denotes 
the vector whose elements consist of the analytic solution at the grid points Xj. The error 
equation (15) can then be expressed as 


where 


4 = 


iw-drso*- 


(16) 


(17) 


Lemma 2.1 Suppose that |^(t)| > K 0 h q for some j, 0 < j < s - 1 and 0 < T 0 < t < T u 
where Kq is independent ofh,t. Then the solution eft) of equation (16) satisfies 


max \\e(t)\\ h > K x h q ^ 2 

0 <KTi 11 v 7 


( 18 ) 


for h < 1, where 


is defined by eq. (13). 


f 
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Proof 

Integrating equation (16) with respect to t gives (recall that e(0) = 0) 


where 


Hence 


where we have used 


Pe(t) = -Q J e(r )dr + G(t) , 0 < t < T x , 

G{t) = I g{ T )dr . 

Jo ~ 

IlfZWIU < (lli’ll* + jllQIU) max lUWIIa. 
II /V)*1U< /‘|| £ (r)|UiT 

Jo Jo 


which follows from Jensen’s inequality and the homogeneity of || • \\k. For h < 1 we thus 
obtain 


max \\e(t)\\ h > 

o<KTi _v y 


IIPIU + r.HOlU 

In particular, the inequality is true for t — T\. Thus 

h . 


l|G(0IU, 0<t<r,. 


max \\e{t)\\ h > 


0 <t<Ti 


IPWk + TMl, 


f gj (T)dT\h 1/2 
Jo 


For convenience it will be assumed that the smooth component gj be positive for To < 
t < Ti. Furthermore, e_,(0) = 0 implies that ^(0) = 0. Thus, we can assume without 
restriction that gj(t) > 0 for 0 < t < T\. Consequently, 


f gj(r )dr \ = / gj{r)dr > T l K 0 h q 
Jo Jo 


Thus, 

The lemma is proved. 


£ iPiu+til51u*' w ‘ 


□ 


Remark: The lemma allows for the possibility of decreasing the accuracy near the bound- 
ary by order 1.5. In reality, however, the order of accuracy must be an integer. Thus, the 
accuracy can be decreased one order near the boundary but not more. Furthermore, the 
extra factor of h 1 / 2 is an effect of the Z^-norm, and would not be present in the maximum 
norm. 


3 Modified Elements Near the Boundary 

We shall prove that there is no modification of the piecewise linear space Sh near the 
boundary such that global fourth order accuracy is retained. 
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Theorem 3.1 There is no local modification near the boundary such that the supercon- 
vergence property is retained with the Galerkin method. Indeed, with piecewise linear 
elements at inner points, the maximum, convergence rate is G(h 2 ). 

Proof 

We first make the assumption that the basis functions of Sh satisfy — $ ij ■ 



Furthermore, in the interior it will always be assumed that the (pi’s are given by (8). Let 
u be the vector with components uj. The Galerkin method (6) can then be expressed as 

p Tt=>- (19) 


Here P and Q are well defined at inner points by (10), and we partition them as 





0 ... 

\ 

"a 

II 

Pl2 \ 

P 22 ) ’ 

P\ 2 — 7 
0 

0 

1 0 . 

... j 


P 22 — 7 


/ 4 1 
1 4 1 

1 1 


( 20 ) 


n — ( ^ 12 

Q - l -QL Q 22 


Q 22 2 


/ 0 1 

-1 0 1 

-1 0 

V 


/ 




/o ... 

0 

V 1 0 


( 21 ) 


The submatrices P n , Q 11 are (s x s)-matrices resulting from the modified basis functions 
near the boundary. The elements of P and Q have been normalized to be of order one, 
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i. e., 

Pij = {}p i i V 3 j ) ! h ) 

<Hi = {?»&*>: fdx). 

Whatever space Sh we are using the relation 

^||u‘(.,i)|| J = -u‘(0,() 2 (22) 

holds. Introducing u h = Y, u j¥j we get 

^- t (u,Pu)h = -t*o, (23) 

where the discrete scalar product is defined by eq. (13). Combining eqs. (19) and 

(23) yields 

{u,j^{Q T + Q)u)h = -«o- ( 24 ) 

that is, Q is almost antisymmetric. Summing up, the assumption that <Pi(xj) = 8ij implies 
that the Galerkin method can be written as a difference method (19) where P is SPD and 
where Q is almost antisymmetric. Consequently, 

Pij = Pji i * i J — 0 5 

(jij = < ]ji j * d~ J > 0 , (25) 

<700 = — 1 / 2 . 


It is enough to consider polynomials when investigating the accuracy, and we choose 
f(x) = (x — (s — l)h/2) r , r = 0, 1, . . .. Without restriction we can assume h = 1. At inner 
points Xi , i > s, the elements p , : , q^j are known. For 0 < i < s — 2 the rows in P have 
the non-zero elements pio , . . . , Pi, s -i and correspondingly for Q. In row s — 1 there is an 
extra element p s -i, s = 1/6 and q 3 -i iS = 1/2, respectively. Thus, the accuracy check has 
the form (where we let n = s — 1 for simplicity) 


= ^2 9*i ( ’ 3 ~ 2) + 2^ ,n ( 2 *) ’ 1 = 0, • • • > n ; r = 0, 1 , . . . 


(26) 


For r = 0 we have by taking (25) into account 

n n j 

^ ] qij — ^ ) qji 8io = f 0, . . . , n , (27) 

j=o j=o 1 

and for r = 1 
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Summing (26) over i gives 


n / nV 1 " r (n V 1 n\ r ,A 1 f n 

r §H) -gO-2)S*' + 2(5 +I 

and by (27), (28) 

r g(i-ir[|:^(-|)-^+Kl +1 )^ 

4(i+>r'=g( j -i)'K + ^) + Ki 


For r = 2 we have 


" " /.. .n .n fn\ 2 \ n n fn \ 1 fn 

2 EE lii b* - i 2 - ’ 2 + (2) ) “ 6 + 2 (2 + V + 3 (2 

j—0 t=0 \ ' 

' ' Ui-) 2 


+ 1 


= -< 2 j + 2 U’ + 


By (25) 


EEw* = °> 


i=o t=0 
n n 


ICEE^'O + *) = °’ 


j=0 i=0 
n n 


EE EE — 2 ■ 

i=0 j=0 ^ 

Thus, with m = n/2, we get from (29) 

ml „ 1 


m 2 — ^ + m 2 + m + — + - = — m 2 + -m 2 + -m 2 + m + - , 


(29) 


which has no solution. Accordingly, the accuracy near the boundary can be at most first 
order, and the theorem follows by lemma (2.1). 

Thus far the theorem has been proved for basis functions satisfying ifi(xj) = 8{j. This 
assumption will now be removed. 
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We write a function u h € Sh as 


DC 


j= 0 


( 30 ) 


and in general u h (x.j^t) ^ G?j(i) near the boundary. If we let a be the vector with 
components ct 3 the Galerkin method gives us 




(31) 


which formally is of the same type as eq. (19). Again, P is SPD with the same structure 
as P in (20). Combining eqs. (22), (30) and (31) gives 


(a, l(Q T + Q)a) K = -u h (0,t) 2 = -ul. 


(32) 


Thus, Q is not almost antisymmetric in the sense of eq. (24). Therefore, the first part of 
the proof cannot be directly applied to (31). Since the constitute a basis in Sh, there 
is a transformation from u to a given by 

a = Tu, (33) 


where T has the form 


/ x • ■ ■ x 


\ 


■ J 

i. e., it is the identity in the interior, and with the upper corner block having a size 
corresponding to the support and number of modified basis functions. The approximation 
u then satisfies 

Pt it = T^ru. (34) 

at h 

If w denotes the grid values of the true solution of the continuous problem, i. e., w t = w T , 
then 

PTw, = \QTw + g, (35) 

a — 

where g is the truncation error. Assume that basis functions are found such that gj = 
0(h 3 ) in a fixed number of points adjacent to the boundary, and g : = 0(h 4 ) in the 
interior. Multiplying (35) by T T from the left we get 



where it follows that T T g is of the same order as g because of the assumption that T T be 
bounded. Using (33) in (32) gives 

(«, Ut t Q t T + T t QT)u)h = -ul , 

a 

that is to say, T t QT is almost antisymmetric. Thus, eq. (36) is third order accurate at 
the boundary, P = T T PT is SPD, and Q = T t QT is almost antisymmetric. But this is 
impossible according to the first part of this proof, and we have arrived at a contradiction. 
This proves the theorem. ^ 


4 Generalized Norms 


We shall next discuss how the results of the previous section can be generalized. To make 
the presentation more concise we introduce the concept of (p, ^-approximation of d/dx. 

Definition 4.1 A difference approximation D is called a ( p,q)- approximation of d/dx 
if D is at least p th order accurate at the grid points Xq, . . . ,x s -i, and at least q th 
order accurate at x s , s is a fixed number independent of the uniform grid spacing 

h — ~~ > 


It has been demonstrated that there exist no matrices P and Q , P tridiagonal in the inte- 
rior and SPD, Q almost antisymmetric, such that P~ l Q is a (3,4)-approximation of d/dx. 
In particular, there exists no Galerkin method that results in a (3,4)-approximation. In- 
terpreting D = P~ l Q as a finite difference operator, however, one can try to construct 
a different scalar product in which it is possible to establish the summation-by-parts 
property, which will be pursued in the following paragraphs. 


Let D = \P~' Q be a (p, ^-approximation of d/dx, where it is no longer assumed that 
P be SPD, nor that Q be almost antisymmetric. This allows us to consider more general 
operators than those of section 3, since P was SPD. The operators P and Q are given by 


P = 



P 12 

Pj 2 


, Q = 


Qn Q 12 A 
— Ql 2 Q22 ) ’ 


(37) 


where P u and Q u are arbitrary (s x s)-matrices; P n must of course be non-singular in 
order for D to exist. The structure of the remaining blocks is 


P12 — 


/ 0 

0 

Pt 1 

V pi 


p 22 


{ Po pi 
Pi po 

P n 


Pri 0 ... ) 


Pri \ 


(38) 


n 


=r 


10 


Q 12 — 


l 0 

o 

<ir 2 

V ?1 


?r 2 0 . . . y 


> <?22 — 


/ 0 <7, 

-91 0 




If D is to be almost antisymmetric with respect to some scalar product (•,■) it is 
necessary that 

(«, Du) = (u, t PP-'Qu) h = - l -ul , (39) 

where P is the SPD-matrix representing the scalar product. It is then natural to try a 
scalar product (u,v) = ( u,Pv)h , where P is obtained by modifying P at the boundary. 
Hence, P may be written as 

/ D d \ 

(40) 


p ( Pn P\2 

l pi ; P 22 


where Ai G R sXs . It is no restriction to assume identical block size of Pn and P\ 1 , since 
it is possible to extend the size of P\\ so as to match that of P\ 1 ; s must of course be 
independent of the mesh size h. 


Define H = PP a , i. e., P — HP, where H is blocked as 

/ H u H 12 \ 


H = 


i?21 H 22 


(41) 


From eq. (39) it follows that HQ must be almost antisymmetric; P = HP requires that 
H P be SPD with the same interior structure as P. Hence, given a (p, ^-approximation 
D = \P~'Q , we must find a matrix H such that 


P22 = i/21 P\2 + i/22/22 
P\ 2 = H\\P \2 + i/12/22 

P \2 ~ i/21 i^ll + H22P12 ( 42 ) 

0 = i /21 Q\2 + H 22 Q 22 + (//2lQl2 + ^ 22 ^ 22 )^ 

0 = i/11 Q\2 + H12Q22 + {H2\Qn “ H 22 Qn) T ? 

which follows from the symmetry of P = HP , the antisymmetry of HQ , and the structure 
assumption (40). The equations above will be used to derive constraints on H. 

The submatrix P 2 2 defines a difference stencil, which in the interior points yields the 
characteristic equation 

p Tl x 2ri + . . . + Po^ 7 " 1 + • ■ • + Pri — 0 (43) 
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with v distinct roots x*., k = 0, . . . , v — 1, each with multiplicity \i k. The general solution 
of the corresponding difference equation may then be written as 

k = 0 


where a*(j) is a polynomial in j 


ak(j) = akij 1 ■ 
1=0 


Furthermore, with each root x k , k = 0, . . . , v- 1, we associate the upper triangular matrix 


( tilif? utf } ... ui k) , \ 


Uk = 


l 00 a 01 
(k) 
«11 


U 


o 

(*) 


(44) 


(fc) , 


where 


U<S = (-9. 1 (-r 2 )'- , "*r’ - ■ ■ ■ - + • ■ ■ + l ) 


(45) 


with 0 < m, / < /xjt - 1. The binomial coefficients are defined to be zero whenever m > l. 
Thus, all matrix elements of U k are defined by eq. (45). 


Assumption 4.1 For each root x k , k = 0, . . . , v — 1, of the characteristic equation (43), 
the associated upper triangular matrix Uk (44) 15 non-singular. 


Lemma 4.1 Let the matrices P and Q be defined by eqs. (37), (38), and suppose that 
assumption ( 4-0 holds. If there exists a matrix H such that Q — HQ is almost antisym- 
metric and P = HP symmetric, P given by eq. (40), then 


H = 


H n 0 

0 7 


/ h 


H u = 


00 


ho,s-r-l 0 • ■ • 0 \ 


h s - r — 1,0 • - • h$ — t — 1,3 — r — 1 


\ ^5— 1,0 ■ • ■ hs— 1,4-r-l 0 • • • 1 / 


where r = max(ri,r 2 ). 
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Proof 

From the first equation of eqs. (42) one obtains the inhomogeneous difference equation 

Pn hij + . . . + Po^tJ+n + • . • + Pr, hij+2ri — Pr, <^J+2r, + • • • + PO^t'J+r, + • • • + Pr , $ij ? (46) 

where 6{j is the Kronecker delta and i > s, j > s — t\. To begin with we consider j > i, 
which implies eq. (43). Hence 


h 




]C a kU) X l, 


k—0 


j > i 


where 


a k{j) 


X] a klj‘ 


1-0 


Next we solve for /i, t by setting j = i in eq. (46). Using the general solution for j > i 
yields 

p-i ^**-1 

Pn X^ 

it=o /=o 

v— 1 

4” Pr, X) «*(*)** + Pn , 

where the expression between the square brackets vanishes, since is a root with multi- 
plicity p*. This implies 

t/-i 

ha ~ ^ T ^ * 

fc =0 

In a similar vein is determined as 

v— 1 

,1 — 1 = ^ y tiki} 


Pr, i f *l + - - - + PD(i + ri)'a:* +ri + - . - + p T1 (i + 2 ^ 


An induction argument shows that 

i/— i 

/*«; = XI a k(j) x l> s-ri<j<i-l. 

k= 0 


Hence 

l/ — 1 

= X a k(j) x i + j > s — r i . (47) 

fc=0 

Next, we make use of the fourth equation of eqs. (42), which implies that the diagonal 
elements of H 2 \Q \2 + H 22 Q 22 are zero, i. e., 

— 9r 2 ^t ,1-7-2 — ... — + qihi,i+\ + . . . + q r2 hi : i + r 2 = 0 . 
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Inserting eq. (47) gives 


U-l Hk~ 1 

o = EE 


fc= 0 /=0 


-q T2 a k i(i - r 2 )V fc r2 - ... - q\a kl (i - l)V fc 1 
+?ia w (i + 1)'4 +1 + • • • + <F 2 a«(* + r 2 )'x ’ fc +r2 . 


The binomial theorem yields 

EE E ««“!»/ *“4 = 0, i>s, 
k = 0 /=0 m = 0 

where we have used eq. (45). Let /i = max&(//&)> and extend the coefficients a^i by defining 
a k ; = 0 whenever flk < l < fi — 1. Similarly, let = 0 for / < m < fi — 1, whence 


2 EE afc/ u m/* m 4 = 0, *>5. 

fc=0 f=0 m=0 

Rearranging the sum one obtains 

i'-i A " 1 ,.A 

53 53 53 a U U ml * m 4 = °» * > 5 • 

77l“0 \ / = 0 f 

The vectors (i m x' k ), i > s, are linearly independent for all k and m. Thus 

53 aw<43 = 0, 0 < A: < 1 / - i, 0<m<^-l. 

j=o 

Since a k i = 0 for fj. k < l < ^ — 1, it suffices to sum from l = 0 to l = (i k — 1. Furthermore, 
when n k < m < \i - 1 it follows that / < m for 0 < / < fi k - 1 , which according to the 
extension of u implies that the sum vanishes identically. Consequently, no conditions 
are imposed for fi k < m < — 1, and we are left with 

Jlfc — 1 

53 a ki u m'i = 0) 0 < k < v — 1, 0 < m < /it — 1 . 

1=0 

Defining a k = (a k0 . . . a fc ,^ fc _ i) T gives the following system 


= 0, 0 < k < v — 1 , 

which only has the trivial solution a k = 0, since U k is assumed to be non-singular. From 
eq. (47) it follows that 

hij - Sij , i > s, j > s - rq . (48) 

In particular, H 22 = I- By means of the third equation (42), H 22 = I leads to H 2 \ = 0, 
since P u is non-singular. 
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To determine H n and H r2 we begin by examining the second equation of eqs. (42), 
which implies 

Pr i + . . . + Pohij +ri + . . . + p Tl h{j+2ri = RHS, 0 < i < S — 1 j > S — Cj , 

where the right hand side (RHS) is defined by 

RHS = {° 0 < i < s - n — 1 

( Pri—n+l^i,j+n—\ T ■ • • T Pr\&ij i = S T] 1 T n, 1 ^ U ^ Tj . 

Arguing exactly as before one obtains 

V— 1 

hij = ^ ^ "P j i 1, J ^ S V\ . 

k = 0 

Finally, we use the last of eqs. (42), which simplifies to 

HnQn + # 12^22 = Qn « 

Using the equations for the diagonal elements we recover 

— q r2 h t)l -r 2 — ... — 1 + • • ■ + <7r 2 ^»,i’+r 2 — 0 ) 

which again yields 


— 5 t j, 0 < i < s — 1, j > s — r \ . 
Hence, H 12 = 0. Furthermore, eq. (49) also shows that 

/ froo • • * ^0,3-n-i 0 ... 0 \ 


Hu = 


h S — Ti—\,0 • • * ^5— ri — 1,5 — 7*1 — 1 


\ 1,0 • * ■ h \ f s~-T\ “1 0 ... 1 J 

This proves the lemma if r\ > r 2 . Suppose that r 2 > r\. Then 

tt n ( H u Qu H U Ql2 \ 

HQ = { -QL Qn J ■ 

Since HQ is almost antisymmetric, it follows that 

HnQ\2 = Q12 • 

Partition 

Hn = ( «“ 1 ) , Qn~- 


Qll Q \2 \ 

<?21 Q22 j 


(49) 


(50) 
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where H u , Q n € R (s - ri)x{s_ri) , and H 2U Q 21 € R r ' x(s r,) . Combining this partition with 
eq. (50) yields 

Q 11 

0 


(£)«"-( 


(51) 


The explicit structure of Q n is 


Ou — 


( 0 

0 

<Jr 2 


... 0 \ 


... 0 


\ 9r,+l ■■■ ?r 2 0 ... 0/ 

where column r 2 — r a — 1 is the last non-zero column. Equating columns r 2 — rj — 1 results 

( hoo . . . ^0,5 — r j — 1 \ / 0 \ 


m 


0 \ 

/ 


\ hs-i,o h s —i tS — Tl —i J 


<lr 2 

0 


V 0 ) 


The element q V2 is located on row s — r\ — 1 in both column vectors, i. e., 

ri— 1 — ti— 1? ^ — J — ^ ^ • 

If r 2 = n + 1 we are done. Otherwise, assume the induction hypothesis hji = for 
0<j<5 — l,Jfc</<3 — ri — 1, where satisfies 5 — r 2 < k < s — ri — 1. Equating 
columns A: + r 2 — 1 — 5 and using the induction hypothesis one obtains 

hj t k - i = 0 < j < 5 - 1 . 

Since the result is true for k = s - n - 1, it follows by the axiom of induction that 

( hoQ . . . ^OjS — 7*2 — 1 0 * ’ ■ 0 \ 


Hn = 


h S — 7*2— 1,0 * • ’ ^5 — T2 — 1 ,S — ^2 — 1 


\ ^ 5 - 1,0 

which concludes the lemma. 


^3 — 1,3— 


r 2 -l 


0 ... 1 / 
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Corollary 4.1 The conclusion of lemma (4-1) holds if the characteristic polynomials of 
P 22 and Q 22 are relatively prime. 


Proof 

The matrices Uk are non-singular iff the diagonal elements are non-zero. But 

u, ( f } = (-?r 2 zjb r2 - . . . - q\xf 1 + qiXk + . • • + <7r 2 z* 2 ) , 0 < / < /x* - 1 , 

(k) 

where Xk is a root of the characteristic polynomial of P 2 2- Thus, u K u 76 0 iff x k is not a 
root of the characteristic polynomial of Q 22 , i. e., iff the characteristic polynomials of P 22 
and Q22 are relatively prime. 1=1 

Since D = \P~ l Q is a (p, (^-approximation of d/dx it follows that 

(Pv)i = 7 (Qu) t + 0(h p ) i = 0,...,s-l 

n 


and 

(Pv) i = ^(Qu) i + 0(h<>) i = s,s + 1,... 

where v = (u x (x 0 ) u x (x 1) ... ) T . The interior order of accuracy q is assumed to exceed 
that of the boundary. It may of course happen that one gets the interior order of accuracy 
at some of the boundary points, if the interior operator extends into the boundary operator 
P 11 ,Qi\. From lemma (4.1) it follows immediately that 


HP 


H\\P\\ P\ 2 
P\2 P22 


, HQ = 


HnQn Q 12 \ 

-QL Q 22 ) ' 


(52) 


Hence, 

(HPv) i = i (HQu), + <D{h p ) i = 0,...,s-l 

and 

(HPv\ = j- ( HQu\ + 0(W) i = s, s + 1, . . . 

i. e., D — is a (p, (^-approximation of d/dx, where P is SPD, and Q is almost 

antisymmetric. 


We now return to the case where P and Q, defined by eqs. (20)— (2 1 ) , are tridiagonal 
in the interior. 


Theorem 4.1 There exists no (3, 4 ) -approximation D = j^P^Q of d/dx, P and Q de- 
fined by eqs. (20)-(21), that is almost antisymmetric with respect to a tridiagonal norm 
P such that P 22 = P 22 ■ 
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Proof 

Suppose there were a (3, 4)-approximation D = ^P~ X Q and a norm P such that (u, Du) 
= (u,PDu)h = —1/2 Ug. The characteristic polynomials of P 22 and Q 22 are given by 
p(a:) = x 2 + 4x + 1 and q(x) = x 2 - 1. Clearly, they are relatively prime. Thus 


PP ~ 1 


= H = 






where 


/ /loo 

- * ^0,5-2 0 ^ 

C A 

1 

O 

► . 1,5 — 2 1 / 


according to corollary (4.1). From the assumptions on D we conclude that D = jrP X Q , 
P = HP, Q = HQ given by eq. (52), is a (3, 4)-approximation of d/dx, where P is SPD, 
and Q is almost antisymmetric. But this contradicts theorem (3.1). D 


It should be pointed out that it is not a priori clear that no such norm can exist. 
If P is defined by eq. (40), with P t2 and P 22 given by eqs. (20)— (21), one cannot apply 
theorem (3.1) directly to P = HP (symmetric) and Q = HQ (antisymmetric), because 
Q = HQ need not be tridiagonal in the interior, which is needed in order to apply the 
basic theorem (3.1). 
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